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Devices, systems, and techniques for data collecting network 
such as wireless sensors are disclosed. A described technique 
includes detecting one or more remote nodes included in the 
wireless sensor network using a local power level that con- 
trols a radio range of the local node. The technique includes 
transmitting a local outdegree. The local outdegree can be 
based on a quantity of the one or more remote nodes. The 
technique includes receiving one or more remote outdegrees 
from the one or more remote nodes. The technique includes 
determining a local node type of the local node based on 
detecting a node type of the one or more remote nodes, using 
the one or more remote outdegrees, and using the local out- 
degree. The technique includes adjusting characteristics, 
including an energy usage characteristic and a data compres- 
sion characteristic, of the wireless sensor network by selec- 
tively modifying the local power level and selectively chang- 
ing the local node type. 
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Algorithm Distributed Heuristic for Minimum Set Cover 
1: Broadcast a pilot signal adding the power level used as information. 

2: Run a timeout and listen to all neighbor’s pilot signals. 

3: Broadcast an ACK message at the maximum power of ail received pilots. 
4: Outdcgrco = Number of ACK’s received. 

S: Broadcast Outdegree. 

6: Receive neighbor’s outdegree. 

7: while still unassigned do 
8; Run a random timeout, 

9: while timeout > 0 do 

10: Listen to decisions of neighbors. 

Li: if One neighbor becomes raw data node then 

12: Stop timeout 

13: Become aggregating node 

14: Broadcast decision 

15: end if 

16: end while 

17: if There are still unassigned neighbors then 

18: if Outdegree > Unassigned neighbor’s outdegree then 

19: Become raw data node. 

20: Broadcast decision. 

21 : else 

22: Restart timeout. 

23: end if 

24: else 

25: All neighbors are Aggregating Nodes 

26: Become raw data node. 

27: Broadcast decision. 

28: end if 

29: end while 


Fig. 5 
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Sc if Request received then 
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DISTRIBUTED TRANSFORMS FOR 
EFFICIENT DATA GATHERING IN SENSOR 
NETWORKS 

PRIORITY CLAIM 5 

This patent document claims the benefit of the U.S. provi- 
sional application No. 61/363,592 entitled “Efficient Com- 
munications and Data Compression Algorithms in Sensor 
Networks Using Discrete Wavelet Transform” and filed on 10 
Jul. 12, 2010, which is incorporated herein by reference in its 
entirety. 

FEDERALLY SPONSORED RESEARCH OR 
DEVELOPMENT 

This invention was made with government support under 
Contract No. AIST-05-0081 awarded by NASA and under 
Contract No. CCF-1018977 awarded by the NSF. The gov- 20 
emment has certain rights in the invention. 

BACKGROUND 

A sensor network can include a set of sensors or sensing 25 
nodes that are capable of sensing, communicating, and pro- 
cessing. An early example of sensor networks is a network of 
acoustic sensors deployed at the ocean bottom to detect and 
keep track of submarines. In other examples, sensors can be 
used to perform various measurements (e.g., temperature or a 30 
presence of a target substance) or capture images for various 
applications. Disposable sensors with processing capabilities 
can be deployed in a number of environments to perform tasks 
such as target tracking (e.g. vehicles, chemical agents, or 
personnel), traffic control, environment monitoring and sur- 35 
veillance. Such sensors can be, for example, wireless sensors 
to wirelessly transmit or receive signals. 

SUMMARY 

40 

Communication, collection and processing of data from 
such sensor networks require communication bandwidths to 
carry the data and consume energy. Hence, it may be desirable 
to reduce the amount of data to be communicated and trans- 
mitted and to reduce the energy consumed. In many applica- 45 
tions, sensors can collect data at different locations, such that 
the information is correlated across locations, e.g., some 
closely located sensors. As an example, temperatures mea- 
sured by temperature sensors near one another may be corre- 
lated so there is certain redundancy in the individual measure- 50 
ments obtained by these separate sensors. As a result, some 
unnecessary data may be transmitted through the network. 

The redundancy in the data from sensors may be reduced or 
removed via signal processing at the sensor level to transform 
the raw data collected by the sensors. The transformed data 55 
with the reduced redundancy may be communicated through 
the sensor network. This reduction in the amount of data 
reduces the energy consumed in transmitting the data through 
the sensor network because the transmitted data is less than 
the raw data collected by the sensors. However, the process- 60 
ing at the sensor level for reducing the data redundancy con- 
sumes energy. It is possible that the total energy consumed in 
processing the raw data and the transmission of the processed 
the data with a reduced amount of data bits may not be less 
than the energy consumed for directly transmitting the raw 65 
data. Hence, the data processing mechanism in processing the 
raw data at the sensor level to reduce data redundancy should 
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be designed according to the specific structure sensor net- 
work to reduce the overall consumption of energy. 

Systems and techniques for wireless sensor networks can 
include mechanisms for eneigy-aware compression optimi- 
zation using distributed energy-efficient set covers. Systems 
and techniques for wireless sensor networks can include 
mechanisms for unidirectional graph-based wavelet trans- 
forms for sensor network data collections. 

In one aspect, a technique for data collecting network such 
as wireless sensors includes detecting one or more remote 
nodes included in the wireless sensor network using a local 
power level that controls a radio range of the local node. The 
technique includes transmitting a local outdegree. The local 
outdegree can be based on a quantity of the one or more 
remote nodes. The technique includes receiving one or more 
remote outdegrees from the one or more remote nodes. The 
technique includes determining a local node type of the local 
node based on detecting a node type of the one or more remote 
nodes, using the one or more remote outdegrees, and using the 
local outdegree. The technique includes adjusting character- 
istics, including an eneigy usage characteristic and a data 
compression characteristic, of the wireless sensor network by 
selectively modifying the local power level and selectively 
changing the local node type. Other embodiments of this 
aspect include corresponding systems, apparatus, and com- 
puter programs encoded on computer storage devices are 
possible. 

In another aspect, devices and systems can include trans- 
ceiver electronics to communicate using a local power level 
that controls a radio range; and processor electronics config- 
ured to detect one or more nodes of a network based on the 
local power level, (control the transceiver electronics to trans- 
mit a local outdegree to the one or more detected nodes, the 
local outdegree being based on a quantity of detected nodes, 
receive one or more remote outdegrees from the one or more 
detected nodes, determine a local node type of the apparatus 
based on the local outdegree, the one or more remote outde- 
grees, and one or more detected node types corresponding to 
at least a portion of the one or more detected nodes, respec- 
tively, and adjust characteristics, including an eneigy usage 
characteristic and a data compression characteristic, of the 
network by selectively modifying the local power level and 
selectively changing the local node type. The processor elec- 
tronics can be configured to cause an increase of the local 
power level to increase the radio range to reach one or more 
additional nodes and control a transmission of an additional 
signal at the increased local power level to cause the one or 
more additional nodes to become an aggregation type, where 
the local node type is a raw type, where a node of the aggre- 
gation type aggregates and compresses raw data. 

In another aspect, a system can include one or more first 
nodes configured to collect data, route data, or both; and a 
second node configured to collect data, route data, or both. 
The second node can be configured to perform operations that 
include detecting one or more of the first nodes using a local 
power level that controls a radio range of the second node; 
transmitting a local outdegree, the local outdegree being 
based on a quantity of the one or more detected nodes; receiv- 
ing one or more remote outdegrees from the one or more 
detected nodes; determining a node type of the second node 
based on information including the local outdegree, the one or 
more remote outdegrees, and one or more detected node types 
corresponding to the one or more detected nodes, respec- 
tively; and adjusting characteristics, including an energy 
usage characteristic and a data compression characteristic, of 
the system by selectively modifying the local power level and 
selectively changing the node type of the second node. 
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These and other aspects can include one or more of the 
following features. Determining the node type of the second 
node can include exchanging messages with at least a portion 
of the detected nodes, the messages indicating a node type, 
selectively assigning the node type of the second node as a 5 
raw type based on information including the local outdegree, 
the one or more remote outdegrees, and one or more of the 
messages, and selectively assigning the node type of the 
second node as an aggregation type based on a neighbor node 
being of the raw type, where a node of the raw type sends raw to 
data to a node of the aggregation type, where the node of the 
aggregation type aggregates and compresses raw data. 
Detecting one or more of the first nodes can include transmit- 
ting a pilot signal based on the local power level, the pilot 
signal including an indication of the local power level ; receiv- 1 5 
ing one or more pilot signals from at least a portion of the first 
nodes; transmitting an acknowledgement message based on a 
maximum power level indicated by the one or more received 
pilot signals; and receiving one or more acknowledgements 
from at least a portion of the first nodes. Adjusting the char- 20 
acteristics can include increasing the local power level to 
increase the radio range to reach one or more additional 
nodes; and transmitting an additional signal at the increased 
local power level to cause at least a portion of the one or more 
additional nodes to become the aggregation type. A node of 25 
the aggregation type can be configured to perform a portion of 
a distributed wavelet transform to generate compressed data. 
Adjusting the characteristics can include performing an 
approximation-based distributed minimum set cover modifi- 
cation algorithm to change one or more node type assign- 30 
ments of nodes within the system to increase a data compres- 
sion ratio. Performing the algorithm can include increasing 
the local power level to increase the radio range to reach one 
or more additional nodes. 

The details of one or more implementations are set forth in 35 
the accompanying attachments, the drawings, and the 
description below. Other features will be apparent from the 
description and drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 40 

FIG. 1 shows an example of a process for adjusting energy 
and compression characteristics of a network. 

FIG. 2 shows another example of a process for adjusting 
energy and compression characteristics of a network. 

FIG. 3 shows an example of a block diagram of a node 
included in a wireless sensor network. 

FIG. 4 shows an example of a quantized radii construction. 

FIG. 5 shows an example of an algorithm based on a 
distributed heuristic for a minimum set cover. 

FIGS. 6A, 6B, 6C, and 6D shows examples of distributed 
set-cover modifications. 

FIG. 7 shows an example of an approximation-based dis- 
tributed minimum set cover modification algorithm for raw 
data nodes. 

FIG. 8 shows an example of an approximation-based dis- 
tributed m inimu m set cover modification algorithm for aggre- 
gating nodes. 

FIGS. 9A, 9B, 9C, and 9D shows different example con- 
figurations of a wireless sensor network based on different 
optimization techniques. 

FIG. 10 shows examples of raw-costs and total costs for 
different set covering algorithms. 

FIG. 11 show examples of cost-distortion curves for dif- 
ferent set covering algorithms. 

FIG. 12 shows an example of an algorithm for choosing a 
greedy set cover in an unweight directed graph. 
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FIG. 13 shows an example of an algorithm for choosing a 
greedy set cover in a weighted vertex directed graph. 

FIG. 14 shows an example of cost comparisons of different 
lifting schemes. 

FIG. 15 shows an example of raw data transmissions 
required by different lifting schemes. 

FIGS. 16A and 16B show examples of different transfor- 
mation structures. 

FIG. 17 shows an example comparing energy consumption 
for different lifting schemes. 

DETAILED DESCRIPTION 

This document describes, among others, implementations 
of data processing and communicating systems and tech- 
niques based on distributed wavelet compression algorithms 
for processing data at the sensor level to decorrelate data of 
the sensors and to reduce the energy consumption in sensor 
networks such as sensor networks with wireless sensors. 

Sensor network nodes can run one or more algorithms that 
optimize spatial compression in a wireless sensor network 
(WSN) on one or more arbitrary communication graphs. In a 
spatial compression technique, some nodes such as raw nodes 
are required to transmit raw data before spatial compression 
can be perforated. Nodes that receive raw data such as aggre- 
gating nodes can perfonn spatial compression based on infor- 
mation including data received from raw nodes. 

A spatial compression scheme can use a raw aggregating 
node assignment (RANA) to enable compression. Since 
transmitting raw data bits in a WSN may require more bits 
than transmitting compressed data, a RANA for a WSN can 
be determined which minimizes the number of raw nodes in 
the network. In some implementations, nodes can choose 
different radio ranges for data transmission. The problem of 
optimally selecting raw nodes and their radio ranges can be 
formulated as a set cover problem, which can be solved via a 
linear programming based solution. In some implementa- 
tions, distributed optimization algorithms can be performed 
by at least some nodes of a WSN. 

In some implementations, sensor network nodes can per- 
form a lifting-based wavelet transform for any arbitrary com- 
munication graph. In some implementations, nodes can per- 
form one or more unidirectional graph-based wavelet 
transforms which are computed as data are forwarded 
45 towards the sink on a routing tree. 

A WSN can optimize a RANA by assigning as raw data 
nodes with greater outdegree in the network and force their 
neighbors to become aggregating nodes. This can be per- 
formed in a distributed way by allowing each node to 
50 exchange a few messages with its neighbors and then decides 
between becoming a raw or aggregating node. A technique for 
RANA optimization can include determining the number of 
neighbors at each node, exchanging outdegree information, 
and performing sequential raw aggregating assignment. In 
55 some implementations, a WSN can perform a distributed 
heuristic for minimum set cover. 

A wireless sensor node can include processor electronics 
such as a microprocessor that implements methods such as 
one or more of the techniques presented in this document. A 
60 node can include transceiver electronics to send and/or 
receive wireless signals over one or more communication 
interfaces such as an antenna. A node can include other com- 
munication interfaces for transmitting and receiving data. A 
node can include one or more memories configured to store 
65 information such as data and/or instructions. 

Nodes in a WSN can be energy-constrained devices. The 
WSN can collectively perfonn in-network compression for 
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energy-efficient data gathering and reporting. In the context 
of compression schemes which use distributed spatial trans- 
forms, an objective is to compress the data into transform 
coefficients that require as few bits as possible to transmit. 
Distributed spatial transforms can use spatial transforms to 5 
de-correlate data across neighboring nodes, leading to repre- 
sentations which require a smaller number of bits than that 
needed for representing raw data. This ultimately reduces 
communication costs and power consumption. 

A WSN can use one or more compression schemes to to 
reduce costs associated with transmitting sensor data within 
the WSN to one or more sink nodes. Various examples of 
compression schemes for data-gathering include cluster- 
based compression and routing-driven compression. In clus- 
ter-based compression, the network is divided into clusters 15 
and nodes in each cluster route data to a designated cluster- 
head, which compresses the data received from nodes in its 
cluster and forwards the compressed result to the sinks. 
Clearly in this case joint compression of data aggregated on 
the cluster-head should lead to very few bits required to 20 
represent it. Note that here cluster-heads act as aggregating 
nodes, and others serve as raw nodes. In the routing-driven 
compression schemes, raw nodes first transmit raw data to 
their neighbors, then each neighbor compresses its own data 
using any raw data that it receives and forwards the com- 25 
pressed result to the sink. Nodes in a network can use a 
lifting-based wavelet transforms. These wavelet transforms 
are useful in distributed data gathering applications because 
they have been shown to provide good coding efficiency 
while leading to simple constructions for arbitrary network 30 
configurations. These transforms are constructed by first 
dividing nodes into disjoint sets of even and odd nodes. Data 
from odd nodes are then predicted using data from neighbor- 
ing even nodes, yielding detail coefficients. Then, data from 
even nodes are updated using detail coefficients from neigh- 35 
boring odd nodes, yielding smooth coefficients. Lifting trans- 
forms are invertible by construction and only require assign- 
ing to each node a role (even or odd). Moreover, the lifting 
procedure facilitates distributed computations which are 
localized, e.g., nodes can compute transform coefficients 40 
using only data received from their direct (1-hop) neighbors. 
Note that in a lifting transform, even nodes first transmit raw 
data to their odd neighbors, then odd neighbors use this data 
to compute detail coefficients. Thus, even nodes serve as raw 
nodes and odd nodes serve as aggregating nodes. 45 

This document describes, among other tilings, distributed 
transforms for data-gathering applications for arbitrary net- 
works that achieves significant gains over raw data transmis- 
sion, while requiring minimal coordination between nodes. In 
most spatial compression schemes, some nodes (e.g., raw 50 
nodes) are required to transmit raw data before spatial com- 
pression can be performed. Nodes that receive raw data (e.g., 
aggregating nodes) can then perform spatial compression. 
Such schemes use a RANAto enable compression. Typically, 
transmitting raw data requires more bits than transmitting 55 
compressed data. Some WSNs use RANAs that select raw 
nodes to minimize overall energy consumption in the net- 
work. In some implementations, the problem of optimally 
selecting raw nodes can be formulated as a set cover problem, 
which can be solved in a distributed fashion for a variety of 60 
scenarios, including single-sink, multi-sink and gossip-based 
networks. 

Distributed data-processing methods can be implemented 
for any domain where the underlying relations between data 
locations can be represented by a weighted graph. The data 65 
defined on the vertices of an arbitrary network, for example, 
can be naturally modeled as a weighted graph. Signals such as 
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those coming from imaging sensors can be interpreted as data 
(e.g., pixel intensities) on a lattice graph. The links on the 
graph can be binary (e.g., they exist or do not exist) or can 
represent a cost (e.g., bandwidth, latency, or energy) of 
exchanging data across the link. Distributed data-processing 
methods are desirable in a wide variety of applications where 
the nodes (data sources) exchange data to perform some 
coordinated task and where global centralized operations are 
impractical either due to the massive size of the data set or the 
cost of data exchanges. It is therefore advantageous to 
develop algorithms that can optimize the amount of data 
communication that is required for a given application. WSN 
can perform data gathering with in-network compression in 
which nodes send their data to single or multiple sinks, or to 
all other nodes in the network. The simplest form of data 
gathering is to have all nodes send raw data to the sink(s). In 
some implementations, raw data refers to data for which no 
spatial processing (e.g., compression) lias been performed. 
Decorrelating before encoding the data is useful since it can 
lead to data representations requiring fewer bits than what is 
needed to represent raw data. This will reduce the amount of 
data that nodes transmit to the sinks, and ultimately reduce the 
total cost of data-gathering in the network. Since bit-rate can 
be significantly reduced by compression, the transmission 
cost associated to data produced by aggregating nodes will be 
significantly lower than the cost associated to raw data trans- 
mission. 

Distributed compression schemes that exploit spatial cor- 
relation can require an assignment of nodes as either raw or 
aggregating (e.g., RANA) and one or more techniques for 
compressing data at aggregating nodes. A WSN can find 
RANAs that minimize total energy consumption under dif- 
ferent scenarios such as single-sink (“all-to-one”), multiple 
sink (“all-to-few”) or gossip (“all-to-all”) cases. A WSN can 
use one or more unidirectional lifting transforms, which will 
be described later, for compressing data. The raw-aggregating 
node assignment techniques can be applied a broader class of 
two-channel transforms where nodes in each channel are 
assigned a different transform. A WSN can be used for dis- 
tributed image/video gathering, where raw nodes can trans- 
mit images to aggregating nodes, which would then encode 
them jointly. Cost functions used for RANA can be based on 
the type of data and compression algorithms used in the WSN. 

FIG. 1 shows an example of a process for adjusting energy 
and compression characteristics of a network. At 105 , the 
process includes detecting a presence of one or more remote 
nodes included in the wireless sensor network using a local 
power level that controls a radio range of the local node. 
Detecting the presence of one or more remote nodes can 
include transmitting a pilot signal based on the local power 
level, the pilot signal comprising an indication of the local 
power level, receiving one or more pilot signals from the one 
or more remote nodes, transmitting an acknowledgement 
message based on a maximum power level indicated by the 
one or more received pilot signals, and receiving one or more 
acknowledgements from one or more of the remote nodes. 
The words “local” and “remote” are used to distinguish 
between different nodes. For example, with respect to a local 
node (which can be any node in a network), other nodes (that 
are not the local node) in a network are referred to as remote 
nodes. 

At 110, the process includes transmitting a local outdegree, 
the local outdegree being based on a quantity of the one or 
more remote nodes. At 115 , the process includes receiving 
one or more remote outdegrees from the one or more remote 
nodes. 
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At 120, the process includes dete rminin g a local node type 
of the local node based on detecting a node type of the one or 
more remote nodes, using the one or more remote outdegrees, 
and using the local outdegree. A node type can be a raw type 
or an aggregation type. A node of the raw type can send raw 5 
data to a node of the aggregation type. A node of the aggre- 
gation type aggregates and compresses raw data. Hie node of 
the aggregation type can be configured to perform a portion of 
a distributed wavelet transform to generate compressed data. 

In some implementations, at 120, determining the local to 
node type can include exchanging messages with at least a 
portion of the one or more remote nodes, the messages indi- 
cating a node type. Determining the local node type can 
include selectively assigning the local node type as a raw type 
based on information comprising the local outdegree, the one 15 
or more remote outdegrees, and one or more of the messages. 
Determining the local node type can include selectively 
assigning the local node type as an aggregation type based on 
a neighbor node being of the raw type, wherein a node of the 
raw type sends raw data to a node of the aggregation type, 20 
wherein the node of the aggregation type aggregates and 
compresses raw data. 

At 125, the process includes adjusting characteristics, 
including an energy usage characteristic and a data compres- 
sion characteristic, of the wireless sensor network by selec- 25 
tively modifying the local power level and selectively chang- 
ing the local node type. Adjusting the characteristics can 
include performing an approximation-based distributed mini- 
mum set cover modification algorithm to change one or more 
node type assignments within the wireless sensor network to 30 
increase a data compression ratio. Performing the algorithm 
can include increasing the local power level to increase the 
radio range of the local node to reach one or more additional 
remote nodes. 

FIG. 2 shows another example of a process for adjusting 35 
energy and compression characteristics of a network. At 205, 
the process includes exchanging messages with remote 
nodes, the messages indicating a node type. At 210, the pro- 
cess includes selectively assigning the local node type as a 
raw type based on information including the local outdegree, 40 
the one or more remote outdegrees, and one or more of the 
messages. At 215, the process includes selectively assigning 
the local node type as an aggregation type based on a neighbor 
node being of the raw type. At 220, the process includes 
performing a distributed modification technique to adjust 45 
node type assignments. Performing a distributed modifica- 
tion technique can include increasing the local power level to 
increase the radio range of the local node to reach one or more 
additional remote nodes; and transmitting an additional sig- 
nal at the increased local power level to cause the one or more 50 
additional remote nodes to become the aggregation type. 

FIG. 3 shows an example of a block diagram of a node 
included in a wireless sensor network. A node 300 can include 
transceiver electronics 305, processor electronics 310, and a 
memory 315. The transceiver electronics 305 can be coupled 55 
with one or more antennas 320 to transmit and receives sig- 
nals. Processor electronics 310 can include one or more pro- 
cessors that implement one or more techniques presented in 
this disclosure. The processor electronics 310 can include a 
central processing unit (CPU), an application specific inte- 60 
grated circuit (ASIC), a field-programmable gate array 
(FPGA), or another suitable component. In some implemen- 
tations, transceiver electronics 305 include integrated trans- 
mitting and receiving circuitry. The node 300 can include one 
or more memories 315 configured to store information such 65 
as data, instructions, or both. In some implementations, node 
300 includes dedicated circuitry for transmitting and dedi- 
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cated circuitry for receiving. The node 300 can include one or 
more sensor probes 325 for collecting data such as tempature, 
pressure, video, audio, radiation measurements, and RF 
energy measurements. Other types of probes are possible. 

1 Distributed Data Gathering Techniques 

Let G=(V, E) be a directed communication graph repre- 
senting an arbitrary network with N nodes indexed by nel={ 1 , 
2, . . . , N} and K sinks indexed by keS={N+l, N+2, . . . , 
N+K}. Assume that each edge (m, n)eE denotes a communi- 
cation link from node m to node n, and that each node has to 
transmit its data towards the set of sinks as efficiently as 
possible. Efficiency can be measured in relation to the energy 
consumption, bandwidth utilization, delay, etc. Therefore, 
nodes have to send their data to the sinks following a routing 
strategy, denoted as T=(V, E r ), which minimizes some cost 
metric (i.e., number of communications, energy consump- 
tion, etc.). For instance, an energy-aware routing strategy for 
single-sink networks T can be the well-known shortest path 
tree (SPT). Similarly, in the multiple-sink case, T can be 
defined as the union of the different eneigy-aware multicast 
routing trees, spanning each sensor from all sinks A transmis- 
sion schedule can be defined in which, first of all, raw nodes 
send unprocessed data to their parents in T (there may be 
multiple parents in the multi-sink case). Then, aggregating 
parents and all other aggregating nodes that have received this 
data use it to compute their detail coefficients. After that, both 
raw and compressed data are forwarded to the set of sinks 
following the given routing strategy. Note that in some wire- 
less networks, the communications are of the broadcast 
nature and hence all nodes within the neighborhood of raw 
nodes will receive the raw data. Let g(n) be the cost of routing 
one bit from node n to all sinks following T. 

Assume that raw data is encoded using B,. bits, and c„ is a 
rate-reduction ratio that reflects the level of data compression 
in node n. We denote the set of raw data nodes as D, and the 
set of aggregating nodes as A. Thus, the total routing cost for 
gathering all data in the sinks can be written as 


C(A, D) = B r g(n) + c « B rg(n)- ^ 

meD neA 

We assume c„ to be some constant ce(0, 1 ] , which would be 
the case for dense networks where the coding efficiency is 
fairly uniform everywhere. Note that Z me/ g(m)=Z mlLD g(m )+ 
2„^g(n) since I=DUA and DOA=0. Therefore (1) becomes 


C(fl)=(l- t )6,^{(«) + ^j(»). P) 

mED nE I 

Since B r , c and (1 -c) have constant values and the graph G 
is fixed, which means that cB,Z, IE/ g(n) is also constant, the 
only optimization left is to minimize Z m&D g(m) under the 
constraint that each raw node in D covers at least one aggre- 
gating node in A. Thus, assuming the transmissions between 
nodes are optimized by the routing strategy, finding a mini- 
mum cost transform is equivalent to finding a RANA that 
minimizes the total routing cost of the raw nodes. Note that a 
trade-off arises: reducing the number of raw nodes means that 
the number of neighbors whose data each aggregating node 
can use is also reduced. This may reduce the level of de- 
correlation achieved by aggregating nodes and, hence, the 
number of bits needed to represent detail coefficients may 
increase. 
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1.1 Centralized Optimization 

Assume that sinks know the network topology and have the 
ability to coordinate the operation of other nodes. In such 
case, sinks can run the same centralized RANA optimization 
and collaboratively distribute information about RANA to 5 
nodes in the network. For lifting-based transforms, the 
RANA can be optimized by minimizing 2 m<LD g(m) while 
ensuring that raw nodes in D form a set-cover in G. A single- 
sink case the optimization can be reduced to solving a MW SC 
problem with weights being defined as the penalty cost of to 
assigning each node to the subset D. Note that a single-sink 
network is a particular case of the problem formulated before, 
and therefore we can approximate the optimal RANA for any 
number K of sinks with a centralized greedy algorithm. 

1.2 Distributed Optimization 15 

Now assume that no centralized optimization is possible. 

Thus, nodes will need to coordinate locally to optimize the 
transform. Note that, in such scenario, nodes do not know a 
priori the cost of routing their data towards the sinks, hence, 
the algebraic formulation used for the centralized case is not 20 
longer valid. One example of this scenario would be when 
nodes want to disseminate their data all over the network (all 
to all). Centralized routing strategies for data broadcasting 
(i.e., minimum spanning tree) are impractical since they are 
not robust to changes in the network topology and link or 25 
node failures. For this reason, gossip-based algorithms have 
been widely studied for information processing and data dis- 
semination during the last few years. In gossip-based proto- 
cols, at each iteration, one random node forwards packets to 
one or a few of its neighbors chosen with a certain probability. 30 
The trade-off of these strategies is that they might require 
more communications and time to converge than centralized 
routing approaches. Although applications based on gossip 
are increasingly being proposed, there is a lack of work on 
spatial compression and distributed data gathering in this kind 35 
of networks. With the implementation of our transform as a 
first step, we provide a distributed scheme in which data is 
first decorrelated and then disseminated via standard gossip 
techniques. 

Assuming that the routing cost to broadcast data all over 40 
the network is approximately the same for all nodes (i.e., 
g(m)=g), the minimization term becomes 2„ ItD g=glDI. 
Therefore, the problem is now equivalent to solving a mini- 
mum set-covering (MSC) problem in a distributed manner. 
Once the RANA has been optimized, the lifting transform can 45 
be implemented following the same steps as before with raw 
nodes sending first raw data to their neighbors and then aggre- 
gating neighbors computing detail coefficients with the raw 
data received. This coordination can be performed by defin- 
ing two different timeouts depending on the node’s assign- 50 
ment. Once data has been compressed in aggregating nodes, 
nodes can start broadcasting both raw and transformed coef- 
ficients all over the network following any kind of gossip- 
based strategy. To ensure invertibility in the network, it can be 
assumes that aggregating nodes encapsulate with their detail 55 
coefficients the indexes of the raw nodes they have used for 
compression. 

Various algorithms for optimizing routing-driven com- 
pression schemes have been described herein for distributed 
data gathering in arbitrary networks that minimize the total 60 
energy consumption in the network. This optimization can be 
formed as a set covering problem, where nodes are assigned 
as raw nodes, which form the set cover, and aggregating 
nodes, which are covered by raw nodes. RANA is based on a 
coordination among nodes. In some implementations, a WSN 65 
employs a centralized RANA in which nodes receive their 
RANA from the central nodes (e.g., sinks) which run a cen- 
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tralized RANA optimization. In some implementations, a 
WSN employs a distributed RANA, where nodes coordinate 
to find a distributed approximation of set-cover solution. 
Minimizing raw data exchange can minimize resource utili- 
zation in a network, such as air link resources, and can 
decrease the total energy consumption of the WSN. Minimiz- 
ing raw data exchange can be advantageous for efficient 
resource usage in other applications such as distributed esti- 
mation and detection. Thus, similar types of optimization 
algorithms can be developed with those applications in mind. 

2 Node Assignments and Node Radio Range Optimiza- 
tions 

This document provides, among other tilings, details and 
examples of centralized and distributed algorithms for find- 
ing minimum cost set covers for routing-driven compression 
schemes. These algorithms jointly optimize the choice of 
radio range and node assigmnentA centralized algorithm is 
described that provides a joint selection of radio range and 
RANA. The centralized algorithm can be formulated as a 
linear program and is solved using standard tools. A distrib- 
uted set covering algorithm is described for optimizing a 
RANA in a distributed fashion. The distributed algorithm can 
include one or more iterations in which nodes can modify 
their radio ranges to improve on the RANA. RANA improve- 
ments can include having fewer raw nodes than the RANAs 
previous methods and provide reductions (e.g., up to 25% 
reductions) in total cost as compared to an existing routing- 
driven approach and provide reductions (e.g., up to 65% 
reduction) in total cost with respect to raw-data transmis- 
sions. Such optimizations can be applied to cluster-based 
compression schemes. 

For a given RANA, there is a trade-off Reducing the num- 
ber of raw data nodes (by increasing the radio range of some 
nodes) means that the average distance between raw nodes 
and aggregating nodes will increase. This increase in distance 
(due to an increase in radio range) tends to reduce the level of 
correlation of data received by aggregating nodes, which in 
turn leads to potentially lower coding efficiency (i.e., higher 
rate will be required to represent the data produced by aggre- 
gating nodes). On the other hand, if we wish to ensure that the 
distance between raw and aggregating nodes is low, we may 
need to add more raw nodes. A WSN can optimize the number 
of nodes which send raw data while taking into account this 
trade-off. The WSN can use raw-aggregating node assign- 
ments which effectively exploit this trade-off by jointly opti- 
mizing the radio range at each node with the node assignment. 

2.1 General Formulation 

A optimization technique can use a general cost function 
for distributed compression in WSNs. Minimizing such a cost 
function can be equivalent to solving a set-cover problem on 
graph. For example, a network can include N nodes and a sink 
(indexed by N+ 1 ), where each node can collect some raw data 
x(n) that it needs to forward to the sink. Let B ; . denote the 
number of bits used to represent the raw data x(n). Assume 
that each node (indexed by nel={l, 2, . . . ,N}) transmits using 
a radio range of R„ and let G=(V, E) be the directed commu- 
nication graph which results from these choices of radio 
ranges. The set of chosen radio ranges for all nodes in I is 
denoted by R={R„} (me{1> 2 , . . . jv})- 

The set of raw-data nodes is denoted by D The set of 
aggregating nodes in the network is denoted by A. Let Njj(n) 
and N [fi] (n) denote the open and closed neighborhoods 
respectively of node n with R radio range. Thus 
N[j](n)={n}UN s (n) where N fi (n) is the set of all nodes that 
can hear transmissions from n. For a fixed graph (e.g., given 
routing tree) when radio ranges of nodes are fixed, N„ N /; (n) 
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and N m N| /( j(n) with R„ being the fixed radio range required 
to forward data of node n to its parent along the routing tree. 

Furthermore, let T=(V, E') denote a routing tree rooted at 
the sink node. Such a tree can be constructed using standard 
routing protocols such as the Collection Tree Protocol (CTP). 5 
The routing tree T is denoted by p(m). Further, let g(n, m) 
denote the cost to route a single bit from node n to node m 
along shortest path in the communication graph G. The cost 
g(n, m) can, for example, be proportional to the number of 
hops, or to the sum of the squared distances between all nodes 10 
along the path from n to node m, etc. In some implementa- 
tions, g(n) denotes the cost to route a single bit from node n to 
the sink along T (e.g., g(n)=g(n, N+l)). In order to cover more 
general case of of variable radio ranges (e.g., where nodes can 
also adjust their radio ranges), the function f( n, R w , m) can be 1 5 
defined as the cost of transmitting a single bit from a node n to 
node m when it has a radio range R„. 

In the general framework, first each raw-node n transmits 
its data x(n) to some processing parent p(n) assigned to n. The 
processing parent p(n) compresses and encodes the received 211 
data of n and transmits it to the sink along the routing tree T. 
The processing parent node can for example be the sink node 
in a routing driven scheme (as the un-encoded data from 
raw-nodes flows all the way to the sink), or it can be an 
aggregating node in cluster based schemes. Thus the cost of 25 
routing data from raw-node n to the sink can be split into (i) 
B,J(n, R„, p(n)), the cost of routing B,. bits of raw-data to the 
processing parent and (ii) c„B,g(p(n)), the cost of routing 
encoded data from processing parent to the sink along short- 
est path. Flere c„ is a rate-reduction ratio for node n’s data, 30 
indicating, number of bits transmitted per single bit of raw- 
data. Thus a lower value of c„ indicates better coding effi- 
ciency for the data at node n. On the other hand, each aggre- 
gating node m listens, receives raw data from its raw node 
neighbors and generates the compressed data, which are then 35 
transmitted to the sink along the shortest path. Therefore the 
cost of routing data from aggregating nodes is given by c„,B,g 
(m). The total routing cost is then equal to the sum of costs of 
routing data from each node: 


C(D, A.R) = Yj B rf ( n • R n ■ P(«» + Yl C " B rglPW) + Y (3) 

neD neD meA 

45 

Note that a given assignment of A and D only makes sense 
if every aggregating node n has at least one raw data neighbor 
from which it can compress its own data, i.e., for all neA, 
N ?; *0. This is equivalent to requiring that every aggregating 
node be “covered” by at least one raw data node, or more 50 
precisely, that U meD N [OT -|=I. In other words, U„ I(LD N [m] pro- 
vides a set cover for the nodes in G and the goal is to find a 
minimum cost set-cover (MCSC) stated in Definition 1 
below. 

Definition 1 Minimum Cost Set Cover Problem: For graph 55 
G=(V, E) define the closed neighborhood j(v) as a disk 
centered at node v with radius R,, and let N be the collection 
of all these disks. A set-cover D N is then a sub collection of 
the sets whose union is V. The MCSC problem is to find a 
set-cover which minimizes the cost function given in (3). 60 

The MCSC problem is a general minimum set cover prob- 
lem and thus NP-hard. In the next subsection we describe 
set-cover formulations for specific cases that will lead us to 
simpler solutions. 

2.1.1 Optimization for Fixed Graph 65 

In this section we describe a simplication of the cost func- 
tion for fixed graph case in which nodes have a fixed set of 
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radio ranges. This can be for instance be the case when nodes 
choose a radio range just large enough to route their data to 
their parent along the routing tree T. 

In the fixed graph case, the data is forwarded along shortest 
path from one node to another and therefore in this case the 
cost to route a single bit from any node n to any node m is ;f (n, 
R„, m)=g(n, m). The cost function in (3) then becomes: 

C(D, A) = ^ B r g(n, pin)) + Y c„B r gipin)) + Y c m B r g(m ) FO 

neD neD meA 

For simplicity, we suppose that the compressed data for 
each node n is encoded using cB,. bits for some constant ce(0, 
1], This could be the case in fairly dense networks since then 
coding efficiency is uniform everywhere. Thus, the amount of 
compression that can be achieved will be about the same for 
all aggregating/processing nodes. The total cost to compute a 
encoded coefficients in a distributed maimer and route the 
resulting coefficients to the sink now becomes 

C(D, A) = Yj B r g(n, p{n)) + Y cB r g(p(n)) + ^ cB r g{n) 

neD neD meA 

Note that 2 m6/ g(rn)=2„, 6D g(m)+2„ 6 . J g(n) since I=DUA and 
DCiA=0. Therefore, (5) becomes 


C(D, A) = Y 1 B r g(n , pin)) + cB r gipin)) - cB r gin)] + ( 6 ) 

neD 

Y cB r g(n) 

me I 

= Y BMn) + Y cB rg(n) 

meD met 

where w(n)=[g(n, p(n))+c(g(p(n))-g(n))] is a generalized 
weight function defined on raw-nodes. This weight function 
w(n) at node n depends on (i) cost of routing node n’s data to 
its processing parent p(n), (ii) rate-reduction ratio c and (iii) 
its routing cost to sink g(n). We will describe shortly the 
weight functions for routing-driven schemes and clustering 
based schemes. Further we observe that B,. and c in (6) are 
constants. Moreover, since the graph G is fixed, c2 JIE/ B r g(n) is 
also constant. Therefore, the only thing left to optimize in (6) 
is Z, ;<LD (B,.w(n)). Thus minimizing (6) under the set-cover 
constraint described above leads to finding a minimum 
weighted set-cover (MWSC) problem stated in Definition 2 
below. 

Definition 2 Minimum Weight Set Cover Problem: For 
graph G=(V, E) denote closed neighborhood n M = 
ii[,,] = {v}U{ueV:vu€E} as a disk centered at node v with cor- 
responding weight g(v) for all nodes vel. Given a collection N 
of all sets {n [v] }, a set-cover D<=N is a sub collection of the 
sets whose union is V. The MWSC problem is to find a 
set -cover with minimum weights. 

2.1.2 Routing-Driven Compression Optimization 

In routing-driven compression scheme nodes are first par- 
titioned into raw nodes in set D and aggregating nodes in set 
A. For these types of schemes, raw nodes transmit their raw- 
data all the way to the sink along routing tree T. Thus in this 
case sink node is the processing parent for all raw-node (i.e. 
p(n)=N+l) and therefore g(n, p(n))=g(n). Further the cost of 
routing data from sink to sink is 0, therefore g(p(n))=0. So the 
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weighted function w(n) for raw-node n in (6) becomes 
w(n)=(l-c)g(n) and (6) can now be written as: 


C(A, D) = (1 - c)B r Y j g(m) + 


Note that B r , c and ( 1 -c) are constants. Moreover, since the 
graph G is fixed, c2 JiE/ B r g(n) is also constant. Therefore, the 10 
only thing left to optimize in (7) is Z mtD g(m), under the 
constraint that raw-nodes form a set-cover (i.e, U m£D N [m] =I) 
Thus, finding a minimum cost transform is equivalent to 
finding an assignment of raw data and aggregating nodes 
which minimizes the average routing cost of the raw nodes 15 
(i.e., 2 m6D g(m)) under the constraint that U mtD N [m] provides 
a set cover for G. This can be formulated as a MW SC problem 
described in Definition 2. 

The solution of the MWSC problem leads to raw-aggre- 
gating assignments which minimize the average routing cost 20 
for raw nodes subject to a fixed graph (i.e., fixed radio ranges 
for each node) by optimizing the node selection. The problem 
of joint optimization (i.e. with respect to both radio range and 
RANA assignments) of cost function can be referred to as a 
Restricted Minimum Weighted Set-Cover (RMWSC) prob- 25 
lem. A WSN can perform a joint optimization to jointly select 
the radio range and node assignment using a linear program 
formulation. The WSN can use distributed approximations in 
this joint optimization. 

2.1.3 Cluster- Based Compression Optimization 

Now consider a cluster-based compression scheme where 
nodes are again partitioned into raw nodes D and aggregating 
nodes A. In these schemes each raw node transmit its raw-data 
to the assigned aggregating node (called cluster-head). Thus 
cost of routing data from raw-node n is given by g(n, p(n)) 
where p(n) is the nearest aggregating node (i.e. cluster-head) 
for node n. The weight function for raw-node n in (4) for the 
cluster based scheme is w(n)=g(n, p(n))+c(g(p(n))-g(n)). 
Since for cluster based methods raw-nodes are situated very 
close to the cluster-heads therefore g(n)-g(p(n)) is small 411 
(since cost of routing data from node n to the sink is almost 
equal to cost of routing data from cluster-head to the sink). 
Further constant ce(0, 1] is very small for a highly correlated 
network. Therefore the cost c(g(n)-g(p(n)))«g(n, p(n)) and 
can be neglected in the weight function. Thus the weight 
function in the cluster-based schemes is w(n)=°g(n, p(n)) and 
for these schemes Equation (4) becomes: 


network nodes in the experimental section. The assignment 
optimization for cluster-based schemes can be developed 
along similar lines as the techniques presented here, and 
remains a topic for future work. 

2.2 Joint Optimization 

In a joint optimization for a routing-driven scheme, adjust- 
ing a RANA and a radio range R„ of each node n in I to 
minimize the total transmission cost. Given such a RANA, a 
WSN can operate by first having each raw node n broadcast its 
data over a radio range R„. This is followed by aggregating 
nodes processing their data based on what they have received 
from neighboring raw nodes and forwarding it to the sink 
along the routing tree T. Therefore similar to section 2.1.2, the 
processing parent p(n)=N+l and the cost function ;f(n, R,„ 
p(n)) is now =;f(n, R„) as defined in (9). 


f(n.R„) = R;,+ min g(s) 

seNfi n (n) 


This is because raw nodes first broadcast their data over a 
radio range R H and the data is then forwarded to the sink 
through nearest (in terms of cost) neighbor in N^ (n). By 
applying these simplifications, the cost function in (3) 
changes to form given in (10). 


C(A, D ) = B,^ s(n . p(n)) + cB^gW- 


As before, c and B ; . are constants and for fixed graph case, 
c2, ie/ B,.g(n) is also constant. Therefore, the only thing left to 55 
optimize in (8) is 2„ I<LD g(n, pn)), which is the cost of routing 
data from raw-nodes to the cluster-heads. Note that the con- 
straint that raw-nodes form a set-cover (i.e, U m<LD N [m] =I) still 
needs to be satisfied and therefore optimizing the cost func- 
tion in (8) is equivalent to solving an MWSC problem stated 60 
in Definition 2. The solution of MWSC for cluster-based 
schemes leads to RANA, which minimizes the routing cost 
from raw nodes to their cluster-heads. Thus for a fixed cluster 
case (i.e. number of aggregating nodes fixed, say k), the 
problem is clustering the nodes into k clusters and can be 65 
solved by a simple k-means clustering algorithm. We show 
results of applying k-means clustering to obtain RANA on the 


C(D, A,R) = Yj R m ) + ^ c n B r g(n) 


Thus total cost is a function of both raw node set D and their 
chosen radio range set R. 

However optimizing the cost function in (10) is NP-hard 
(since the problem is a minimum cost set-cover problem), and 
its complexity increases exponentially with the size of the 
network. In order to simplify it, it can be observed that there 
exist only a finite number of radio range choices Re[R m! ”, 
R"“] for raw-nodes, which can lead to an optimal solution. 
Further, it can be observed that in (1 0) the rate-reduction ratio 
c„ is unknown at each aggregating node n. The value of c„ 
depends on which raw -nodes can broadcast their data to node 
n and what is the correlation between those raw-nodes’ data 
with that of node n. Thus node n in general has a combinato- 
rial number of possible rate-reduction ratios c„. However we 
can reduce the size of this set by IN^™*”! approximate values 
of c„. This is explained in Section 2.2.2. Thus with finitely 
many choices of radio ranges and bit-reduction ratios the 
optimization of cost function in (10), reduces to a Restricted 
Minimum Weighted Set Cover (RMWSC) problem. 

2.2.1 Quantized Radio Ranges Construction 

FIG. 4 shows an example of a quantized radii construction. 
In this figure, a graph 405 shows a raw-node (labeled with n) 
and other nodes (labeled with m). The raw-node n can cover 
a„=3 other nodes within R maa: radius. The size of the neigh- 
borhood N^(n) only increases corresponding to radii {R,, 1 , 
R„ 2 , R ma J- In this example, there are 

a «HN R "‘“"l efficient choices of radio ranges for node n. The 
minimum cost solution for (10) can be found by searching a 
set of set-cover solutions of size equal to product of the 
candidate choices for each node which is bounded by 
O((0d ma J iV ) for some constant 0<1 (for d max , maximum out- 
degree of the network with maximum allowed radio range 

R,™)- 

2.2.2 Rate Reduction Ratio Estimation 

The value of c„ at an aggregating node depends on its 
neighboring raw-nodes, the type of coding that it employs to 
compress its data and the type of encoding scheme used in the 
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network. To estimate c„ at each node n we have two choices. 
First we can estimate a model of c n in the network. For 
example if the nodes have high correlation in their data, then 
any aggregation strategy would reduce number of bits 
required for aggregated data (hence small c n ). Therefore, we 5 
can model c„ inversely proportional to correlation between 
nodes’ data which, for most physical phenomena, can be 
assumed to be a decaying function (denoted R„(d)) of dis- 
tance. Flowever, this kind of modeling provides very conser- 
vative estimates of c„, which results in very loose estimate of to 
cost function. Second we can compute the exact value of c„, 
for each possible set of raw neighbors by computing and 
encoding the transform coefficients at node n with some 
training data. 

In second case this would mean computing roughly 15 


o( 2 Nc 2) 
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2. One disk per node: This means each node n can choose 
only one of these 2a, ; disks available to it. 

2a„ (12) 

YjXn(k) = l 

k = 1 

where %„ is the indicator function. 

3. Coupling restriction: The coupling constraint at node n 
forces the k' ; ' neighbor m„ k of node n to be a raw -node with a 
radius R„ >dist(n, m„*) if node n chooses to be an aggregating 
node with rate-reduction ratio c t k , i.e. 

<•„ \ (13) 

3f„(A + o'„)-^] / i m j(/) . ,0 

<><o " i 


rate-reduction ratio values per node, which is combinatorial 
and therefore intractable. We therefore propose to compute 
only pairwise rate-reduction ratio between neighboring 
nodes. Thus for each neighbor m k , ke{l, 2, . . . a,,} of noden 
we can compute a Q.„ (m k )=c„ k as the worst case c„ if node n 
can only listen to the raw-data of its neighbor n\ k . These 
estimates of c„ are more realistic as they consider enumera- 
tions of all singleton combinations of raw nodes and in a 
minimal set cover of the network, most of the nodes are 
covered by single neighbors. In general the aggregating node 
n will observe a rate reduction ratio c„sc„ k if node m k is 
selected as an raw node with radio range at least dist(n, m k ) 
(the distance between n and m^.). 

2.2.3 Weighted Set Cover Formulation 
In some implementations, a node n can choose to be a raw 
node with exactly a H =IN R ““(n)l choices of radio ranges or can 
be an aggregating node with a n choices for rate-reduction 
ratios c„. These choices can be interpreted as 2a„ weighted 
disks (e.g., sets of vertices) centered at each node n. Further a 
disk can be referred to as a r-disk if it corresponds to choosing 
node n to be a raw-data node, otherwise, a disk can be referred 
to as an a-disk. The k r,! r-disk on a node n has a non-zero radius 
R„ a and covers all nodes in the closed neighborhood N [fi tjln). 
An a-disk has radius=0 and has empty neighborhood (i.e it 
does not cover any vertex of the network). In some imple- 
mentations, first a n disks at each node n are r-disks and 
remaining a rI disks are a-disks. The weight of the disks is 

(B r f(n, «£) if 1. ,a n (11) 

w n,k ~ \ 

\ c k-a n Brg( n ) otherwise 

where w >; k is the weight of k rfi disk at node n and is equal to 
the cost of routing data from node n to the sink, if k rf ' disk is 
selected. Lets consider a sub-collection C of these disks 
which satisfies following restrictions: 

1. Set Cover: The union of selected disks C should cover 
entire graph (all vertices). Since the disks corresponding to 
the nodes selected as aggregating nodes are empty, the union 
of neighborhoods of raw-nodes should form a set cover (i.e., 


U %■] = ' 

m<=D , 


wherekejl, . . ,a„}, and 1 0 is the index of first disk at node m n k 
that covers node n. 

Thus the goal of optimization is to choose such a sub- 
collection C with minimum total weight. This can be defined 
as a Restricted Minimum Weighted Set Cover (RMWSC) 
problem: 

Definition 3 Restricted Minimum Weight Set Cover (RM- 
WSC) Problem: Given a set of nodes vel, let W={ {w„ ^kej 1 , 
2. . . , 2a„]}:ne{l,2 . . . N}} be the collection of set of wei gilts 
for the disks at each node as defined above. A restricted 
set-cover C is a collection of weighted disks that forms a set 
cover and satisfies the restrictions given in (12) and 13. The 
RMWSC problem is to find a restricted set -cover with mini- 
mum total Cost. 


N (14) 

CoSt = Yj W ^Xn(k) 

n=l k = l 

This problem of finding a set-cover is NP-hard in general. 
A technique can find an approximate solution to RMWSC 
problem, such a technique can include centralized algorithm 
or a distributed algorithm. Distributed algorithms can be 
robust to network topology changes and failures since the 
nodes readjust their parameters by local communications to 
find a new set-cover. 

2.2.4 Linear Programming Optimization 

The LP-formulation for MWSC problems is already well- 
known. Here we extend these ideas to our RMWSC problem. 
For this we denote by L c =22^ / a v , the total number of pos- 
sible disks. We define a cost vector c as c=[c 1; c 2 , . . . , c^] 
where c„ is the cost function vector for node n i.e. c„=[w„ 15 
w » 2 > • • • w , 2 al- Thus c is a L c xl size vector containing 
weights of all possible disks at all nodes. Note that this way 
we implicitly number the disks from 1 to L c such that the 
weight of q rf ' disk is c(q). The solution vector x 0 is a binary 
vector of size L c xl such that x 0 (q)=l implies that disk q is 
selected. For the set-cover constraints, we define a set-cover 
matrix A of size NxL c such that A(n, q)=l if node n is covered 
by disk q. Further for the one disk per node constraints, we 
define matrix A eq of size NxL c such that A e? (n, q)=l for all 
disks q centered at node n. 

Finally for the coupling constraints we define another 
matrix B of size L c /2xL c such that each row of B is the 
coupling constraint of (13) on an a-disk of a node (hence the 
row size of matrix BM^^/2). Thus B(q 1: q^l and if disk q : 
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corresponds to k rf ' a-disk of a node n then B(q 1 , q 2 )=-l for 
indices q 2 corresponding to all r-disks of node (i.e., k' ; ' 
nearest neighbor of node n) that cover node n. The Integer- 
Programming (IP) formulation of RMWSC problem can now 
be defined as in Formulation 1 . 

Formulation 1 (RMWSC Problem) Find a solution vector 
x 0 which minimizes 


c' ■ Jto = y ; c(s)-*o(s.) 


subject to the following constraints A-XqM^ (set cover), 
A e •x 0 =l iv (one disk per node), B-x o s0 (coupling constraint), 
and x o (s)e{0,l} (integerality constraint). 

Note that every solution of this problem is a feasible 
restricted set-cover and every restricted set-cover on I is a 
feasible solution of this problem. Therefore the optimal value 
of this problem is equal to RMWSC. Since IP formulation is 
NP-Hard, we find LP relaxation solution, which has same 
formulation as in Formulation 1 except that solution vector x 
is now real and xefOlU. This gives us an linear optimal 
solution vector x*e[01] iv . So we need a way to round this 
fractional solution to an integral one. Our idea of rounding is 
to pick that disk at each node for which the corresponding 
entry in solution vector x is maximum among all disks on that 
node. If there is a tie we pick the disk with larger radius to 
maintain set-cover constraint. Note that although LP-relax- 
ation solution vector x* is guaranteed to be a restricted set- 
cover, the rounded vector, x 0 may not provide a set-cover. It 
does However, select one disk per node and hence satisfies 
those constraints. In this case, for every node n which is not 
covered by integer solution x 0 (i.e. Ax 0 =0), we find the set of 
all raw-data neighbors in the R" 1 ^ radius of node n and choose 
the neighbor which can cover the node n by increasing its 
radio range by minimum increase in cost. This gives us a 
centralized set-cover solution. 

2.3 Distributed Set Cover Algorithms 

Distributed Set Cover Algorithms can reduce the number 
of required communications between nodes and dynamically 
adjust a node’s radio range. Such algorithms can use distrib- 
uted heuristics for computing the raw-aggregating node 
assignments based on information gathered locally by every 
node in its 1-hop neighborhood. A distributed set covering 
algorithm is described below which is constructed from a 
fixed set of radio ranges, using only information gathered 
locally in the 1-hop neighbor. Also described below are dis- 
tributed modification algorithms which reduce the size of 
these set coverings by allowing each node to increase its radio 
range. 

2.3.1 Distributed Heuristic for Minimum Set Cover 

RANA optimization technique can include assigning as 
raw data nodes the ones with greater outdegree in the network 
and forcing their neighbors to become aggregating nodes. 
This can be performed in a distributed way by allowing each 
node to exchange a few messages with its neighbors and then 
decides between becoming a raw or aggregating node. The 
technique can include determining a number of neighbors at 
each node, exchanging outdegree information, and sequen- 
tially performing one or more raw-aggregating assignments. 

FIG. 5 shows an example of an algorithm based on a 
distributed heuristic for a minimum set cover. The algorithm 
of FIG. 5 shows the operations of such an algorithm for a 
single node. In a WSN, multiple nodes will be running this 
algorithm to determine their node types in a distributed and 
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cooperative fashion. After scheduling transmissions between 
adjacent sensors to avoid interference, each node broadcasts 
a pilot signal which includes its transmit power level. After 
listening for pilots, each node broadcasts an acknowledgment 
5 packet at the maximum radio range of the pilots it has 
received. The outdegree from each node is broadcast to its 
neighbors . The outdegree of a node is the number of acknowl- 
edgment it receives. For example, based on a node receiving 
3 acknowledgments, the node’s outdegree is 3. The nodes 
to decide between becoming a raw or aggregating node. In this 
deciding process, the nodes rim random timeouts during 
which they listen to their neighbors becoming raw -nodes. If a 
node becomes a raw-node, the node broadcasts information 
including its label (such as a node identifier or a MAC 
15 address) and a type status (status indicating that “I am a raw 
node” or “I am an aggregation node”). If one node hears a 
neighbor declaring that it has become a raw node, it assigns 
itself as aggregating and broadcasts its label. Otherwise, once 
the timeout is over, the node compares its outdegree with that 
20 of its neighbors. If its out -degree is greater than the out-degree 
of its unassigned neighbors, then node declares itself raw- 
node and transmit its label. Otherwise it restarts another time- 
out and listens to one of its unassigned neighbors turning into 
a raw node. 

25 2.3.2 Distributed Set Cover Modifications 

FIGS. 6A, 6B, 6C, and 6D shows examples of distributed 
set-cover modifications. Based on allowing one or more 
nodes to increase their radio ranges, an assignment given by 
the algorithm of FIG. 5 can be improved. In these figures, blue 
30 squares are raw nodes, red circles are aggregating nodes. An 
arrow from square to circle indicates that the raw-node 
(square) covers the aggregating node (circle). If in FIG. 6A, 
node n increases its radio range from R,/ to R„ 2 , it will be able 
to cover more nodes as shown in FIG. 6B. If in FIG. 6C,node 
35 n switches from aggregating to raw node and increases its 
radio range, then it is able to reduce the size of the set cover as 
shown in 6D. 

In FIG. 6A, the radio range of raw node n is changed from 
R,, 1 to R,, 2 since by doing so it can cover raw-nodes k and m, 
40 both of which in-tum become aggregating (see FIG. 6B). 
There are no raw-nodes created as nodes k and m were only 
connected to aggregating nodes which were either already 
covered by node n with radio range R,, 2 or are being covered 
by other raw-nodes in the network. 

45 In FIG. 6C, aggregating node n is changed to be a raw -node 

since by doing so, it now covers 3 raw -nodes n l5 n 2 and n 3 , 
which then become aggregating (FIG. 6D). Again we make 
sure that doing this does not create extra raw-nodes since the 
aggregating nodes which are neighbors of n 15 n 2 and n 3 are 
50 already covered by other raw -nodes. Below we describe how 
to make these decisions by considering a local cost heuristics. 

Assuming that each node m operates in a radio range within 
the interval [R m "”", R ma J, it may be worth increasing the 
initial R m """ of some sensors to cover other raw nodes. This 
55 will allow those raw nodes to become aggregating nodes, and 
will reduce the amount of uncompressed data that is trans- 
mitted to the sink. Even though the increase in radio range 
will lead to an increase in communication cost, this can be 
compensated with the compression of data in the newly 
60 assigned aggregating nodes. 

For each raw node m let !o. denote the set of raw nodes 
that can overhear in’s data when node m increases its trans- 
mission radio range from R,„ to ~km . In this case node m is 
65 able to cover some other raw data nodes, and it may be 
worthwhile to flip these raw nodes to aggregating nodes while 
increasing node m’ s radio range. Assuming the same notation 
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as in Sec. 2.2, the cost of routing data towards the sink from 
both m and B* will switch from (1 5) to (16). 


Cl =B r (^+g(p(m)))+ Y, B rSW 
>«% m 


( 15 ) 5 


C 2 =B^+g(jS(m)))+ Y 
nS m 


c„B r g(n) 


( 16 ) 

10 


On the other hand, it may be also worth to increase the radio 
range of an aggregating node m, to switch its assignment to 
raw node, and to allow it to cover some present raw data nodes 1 5 

intheset B* .Inthiscase, thereisareversalofroles between 

node m and nodes in B* . Now, the overall cost of routing 
data towards the sink from all the involved nodes will change 
from (17) to (18). 

Ci=c m B r (Rl +g(p(m)))+ Y 
n( $> m 

C 4 +g^(m))]+ 2 c„B r g(n) 

nS 

m 

For gaining additional savings in terms of cost, (15) (resp. 
(17)) must be greater than (16) (resp. (18)). In other words, 

each node needs to find the radio range to* , among its quan- 
tized values as defined in Sec. 2.2, that maximizes the differ- 35 

enceS^j-C, (resp. S=C 3 -C 4 ). By choosing to* inthisway, 
the total cost will always go down. Thus, these heuristics will 
converge to a local minimum. 

We have observed that for uniformly deployed networks is 
better to first let the current raw data nodes check if by 
changing their radio ranges they can improve the present set 
cover, and then do the same with the aggregating nodes. On 
the contrary, for clusterized networks is better to do it in the 
reverse order. However, in both ways, for implementing this ^ 
modification as a distributed algorithm, first of all nodes will 
need to get or approximate the c„ values. The routing cost 

g(m) and g(n) for each ne B m is be exchanged. To avoid this 
cost for learning and exchange, a WSN can use approxima- 
tions to these values by using information available at node m. 50 
In addition to the cost approximations, after each change in 
raw-aggregating assignment we need to check that a set cover 
still exists. Note that these changes in assignment are done 
one node at a time, so it is only necessary to check for a set 
cover within two hops of each node (i.e., checking is local- 55 
ized). In the first case, when a raw node is flipped to become 
an aggregating node, we need to check that the neighbors of 
any raw node that has been flipped is still covered. In the 
second case when an aggregating node is flipped to be raw and 
some of its raw neighbors are flipped to be aggregating, the 60 
neighbors of any raw node that is flipped must still be covered. 
2.3.3 Approximation-Based Modification Techniques 
Approximation-based modification techniques can opti- 
mize a RANA. Such techniques can use approximations to the 
cost equations ( 1 5)-(l 8) that allow nodes to improve the 65 
original set cover resulting from Section 2.3.1 by using only 
information that they have locally available. In some imple- 


( 17 ) 
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mentations, nodes can assume that the rate reduction ratio c, 
is a constant value c between (0, 1) for all neighbors (for 
example, c=0.5). In some cases, the difference in terms for 
routing cost between the current parent in the tree and the one 
after the increase in radio range is negligible, e.g., 

g(P( m ))”g( jp (m). 

Note that in fairly dense networks, node m will often be 
located near the center of its neighbors in B* . Titus, the 
average routing cost of nodes in B* will be similar to the 
cost at node m, i.e., g(m)=X,*w„ g(n)/lB* I. Thus, 2 

B,.g(n)«B,.l&* lg(m) and c„B,.g(n)=cB,.l 

B* lg(m). Thus, 

&1 =B r (R m 2 +g(p(m)))+B r \&™ lg(m) (19) 

=B r $tm +g(p(m)))+cB r fe™ lg(m) (20) 

d?=c«*R,„ 2 +g(p(/H)))+« r !B» lg(m) (21) 

Ui =5 r (to* +g(p(w)))+ci? r |Bw | g(m) (22) 

Based on taking these approximations into consideration, 
and supposing the fact that all nodes already know their 
quantized radio range values, the distributed heuristics for 
improving the minimum set cover for both raw and aggregat- 
ing nodes are given in the algorithms depicted by FIG. 7 and 
FIG. 8. FIG. 7 shows an example of an approximation-based 
distributed minimum set cover modification algorithm for 
raw data nodes. FIG. 8 shows an example of an approxima- 
tion-based distributed minimum set cover modification algo- 
rithm for aggregating nodes. Note that each node m only uses 
information that it has available to decide its radio range, e.g., 
(1 9)-(22) are only functions of information available at node 
m. While this relies on some approximations, it significantly 
reduces the amount of coordination needed to implement 
these modifications in a distributed manner. All that needs to 
be checked is if a set cover still exists. 

In worst case scenario the number rounds of communica- 
tions needed to execute algorithms of FIGS. 7 and 8 in the 
network is 0(d max ) per node, where d max is the maximum 
out degree of the nodes . This can be observed since in order to 
change its parity each node needs to transmit with at most 
d mo* radio ranges. For bounded degree networks the order of 
communications complexity is therefore 0(N). 

The algorithms of FIG. 5, FIG. 7, and FIG. 8 can be 
executed one after another. This is implemented using 
another set of timeouts in network and in each timeout period 
nodes execute an assigned distributed task. By doing the 
algorithms described above, nodes can verify if it is worth to 
increase its radio range just by knowing how many raw data 
nodes can they cover. These distributed heuristics still reduce 
the number of raw data nodes in the network while they are 
suitable in a practical setting due to the low number of addi- 
tional communications needed. 

2.4 Simulations 

In thi s section, the performance of various distributed MSC 
techniques are compared against against a centralized assign- 
ment algorithm based on unidirectional graph-based trans- 
forms as described herein, a Haar-like tree-based transform, 
and a cluster-based KLT. For performance evaluation, we use 
simulated data to generate using a second order AR model. 
This data is (multi-variate) Gaussian distributed. The nodes 
were placed in a 600x600 grid. Hie data value at each node 
corresponds to the value from the associated position in the 
grid. A shortest -path routing tree (SPT) which minimizes the 
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sum of squared distances to the sink node is used for routing. 
We measure energy consumption using the cost model for 
WSN devices where the energy consumed in transmitting k 
bits over a distance D is Ej(k, D)=E eto _k+e am yk-D 2 Joules 
and the energy consumed in receiving k bits is E/((k)=E efec -k 
Joules. The energy dissipated by radio electronics to process 
k bits is given by E elec -k. The routing-driven compression 
methods used for these simulations use a lifting transform, 
such as those described herein. A lifting transform can com- 
pute prediction residuals for each aggregating node using the 
data received from raw node neighbors, and the prediction 
filters are adapted over time. 

For these simulations, transform coefficients are quantized 
using a dead- zone uniform scalar quantizer and are encoded 
using an adaptive arithmetic coder. Performance in all cases is 
measured by the trade-off between the total energy consump- 
tion at each quantization level and reconstruction quality 
measured by the signal-to-quantization-noise ratio (SNR) 
expressed in dB. Higher SNR (for a fixed cost) implies higher 
fidelity reconstruction of the original data, and a difference of 
1 dB in SNR translates to a decrease by a factor of 1 0 in MSE. 

For the cluster-based KLT, the network can be divided into 
clusters using k — means clustering. This choice of clustering 
is based on the analysis in Section 2.1.3 which shows that, 
under the approximations and assumptions given there, 
k — means clustering is a good algorithm for finding an 
assignment that minimizes the total cost. Once the clusters 
have been defined, we compute the KLT in each cluster using 
the normalized linear mean square (NLMS) filters. We use a 
KLT since it is the optimal (orthogonal) transform for repre- 
senting stationary, Gaussian data. Note that each KLT is esti- 
mated using actual data, and in a real system implementation 
those transform matrices would need to be forwarded to the 
sink so that each KLT can be inverted. We choose to ignore the 
cost for transmitting those matrices for the sake of simplicity. 
For each cluster, we choose the cluster-head as the node 
closest to the centroid of the cluster, form an SPT rooted at 
each cluster head, and route data from raw nodes to each 
cluster-head along each SPT. Finally, once cluster-heads 
gather data from the raw nodes in their cluster, they compute 
a KLT, encode the transform coefficients, and transmit them 
to the sink along an SPT. 

FIGS. 9A, 9B, 9C, and 9D shows different example con- 
figurations of a wireless sensor network based on different 
optimization techniques. These figures show minimum set 
covers of a 70 node network. FIG. 9A shows a configuration 
for a distributed minimum set cover. FIG. 9B shows a con- 
figuration for a modified distributed minimum set cover with 
exact values for c M and g(n). FIG. 9C shows a configuration 
for a approximation based modified distributed minimum set 
cover with approximate values for c„ and g(n). FIG. 9D shows 
a configuration for a LP-optimized set cover. 

FIG. 10 shows examples of raw-costs and total costs for 
different set covering algorithms. A graph 1005 shows the 
raw-data cost and total cost for various techniques: cluster 
based, Mistributed Min Cover, Centralized Min cover, Distr- 
buted Modification (Approx), Distributed modification, and 
LP-optimized). It can be seen the graph 1005 , that except 
clustering based method the total cost in the network is higher 
for an algorithm if its raw data cost is higher, with the LP- 
optimized solution providing lowest raw-data cost and hence 
total cost. For clustering based method cluster-heads send 
compressed data for all other nodes in their clusters to the 
sink, while in other methods raw-nodes send their raw-data 
all the way to the sink. So for clustering methods even though 
the raw-cost is high, the network has lower over-all cost than 
routing-driven methods. 
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FIG. 11 shows examples of cost-distortion curves for dif- 
ferent set covering algorithms. The cost versus distortion 
curves are shown in FIG. 11 . The cost-distortion curve forthe 
centralized set cover, which is based on unidirectional graph- 
5 based transforms as described herein, is shown (see central- 
ized Min Cover) as a point of reference, as is a cluster-based 
KLT. The cluster-based KLT gives better performance than 
simple raw data transmission, but still performs worse than 
the routing-driven compression schemes. This is mainly 
to because (i) this cluster-based scheme lias many raw data 
nodes, and (ii) there are many data transmissions that are 
made away from the sink. In this network, the distributed set 
cover algorithm gives similar cost-distortion performance as 
the centralized set cover. The LP-optimized set cover does the 
15 best overall, as is expected. The exact and approximate modi- 
fications are second best and are nearly identical. More 
importantly, they significantly improve on the performance of 
the distributed set cover, with more than 5 dB increase in SNR 
for a fixed cost. 

20 3 Unidirectional Graph-Based Transforms 

A WSN can use a lifting-based distributed wavelet trans- 
form for any arbitrary communication graph. Moreover, Uni- 
directional transforms can be computed as data are forwarded 
toward the sink on a routing tree. Transmitting raw data bits 
25 along the routing trees in a WSN typically requires more bits 
than transmitting encoded data. Thus, it is advantageous to 
minimize raw data transmissions in the network. Lifting 
based distributed wavelet transforms are useful since they are 
invertible as long as, given an arbitrary even and odd splitting, 
30 data from odd nodes is only processed using data from even 
nodes and vice versa. Moreover, even nodes play the role of 
raw data nodes and odd nodes play the role of aggregating 
nodes. Therefore, a transform design can seek to find invert- 
ible, energy-efficient lifting transforms by finding even and 
35 odd splittings which minimize the number of even (i.e., raw 
data) nodes. Such designs will implicitly minimize the num- 
ber of nodes which must transmit raw data, therefore leading 
to transforms which are most energy-efficient overall. Such 
designs construct an even/odd split of nodes which 1) mini- 
40 mizes the number of even nodes while ensuring that at least 
one even node is in the vicinity of each odd node and 2) 
minimizes the cost of raw data transmissions per even node. A 
WSN can use a transmission scheduling for sensor nodes that 
make transform computations unidirectional. 

45 General construction of lifting wavelet transforms on net- 
work graphs, and how to make these transforms unidirec- 
tional is now described. Let G=(V, E) be a directed commu- 
nication graph of a WSN with N nodes indexed by nel={l, 
2, . . . , N}, with the sink node having index N+l and where 
50 each edge (m, n)eE denotes a communication link from node 
m to node n. Let T=(V, E r ) be a routing tree in G along which 
data, denoted by x(n), flows towards the sink. Let depth(n) be 
the number of hops from n to the sink on T and let p„ denote 
the parent of n, C n the set of children of n and D ;I the descen- 
55 dants of n in T. Also let A„ denote the set of nodes that n routes 
data through to the sink excluding the sink, i.e., ancestors of 
n. Finally, let E and O denote some arbitrary set of even and 
odd nodes respectively. 

Lifting based wavelet transforms can use a set of condi- 
60 tions for a distributed lifting transform to be unidirectional 
(e.g., the transform can be computed along T as data is routed 
towards the sink). Given a transmission schedule which 
assigns a time slot to each node to transmit its own data along 
routing tree T, a transform has unidirectional operation if each 
65 node n can compute its coefficients using only data received 
from the nodes which transmit before node n (data from 
descendants D„ and broadcast neighbors B„), and n does not 
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forward data from its broadcast neighbors, i.e., meB„-D„. In 
case of lifting this means that the transform is unidirectional 
as long as each even node transmits its data before all of the 
odd nodes which are connected to it. 

In some WSNs, all even nodes transmit their data first 
according to their depth in the routing tree, followed by odd 
nodes according to their depth in the tree. This way odd nodes 
gather all the data they need to compute their predictions 
before transmitting their own data. This can increase the 
network delays as the nodes may have to transmit their own 
data and data from their ancestors in different time slots. 
However cost-wise it would make no difference since each 
node is transmitting its own data only once. Linear prediction 
operators p„ and update operators u m at nodes neO and meE, 
respectively can be designed in a variety of ways. A W SN can 
use NLMS filters which adapt the spatial prediction filters 
over time. NLMS filters converge to optimal filters over time 
by gradually minimizing prediction errors. Given the even/ 
odd split in the graph, let N„ denote the set of transform 
neighbors for all nodes n, where N M <=0 for neE (i.e., even 
nodes only have odd neighbors) and N m <= E for meO (i.e., odd 
nodes only have even neighbors). Then for each meO, detail 
coefficient d(m) can be computed as: 

d(m) =x(m)~ ^ p m (k)x(k) (23) 

keN m 

Note that if the prediction 2 i€A ^p m (k)x(k) is close to x(m), 
then d(m) will have small magnitude and so can be encoded 
using fewer bits than that those that would be needed for raw 
data x(m). This will ultimately lead to cost reduction for odd 
nodes since they transmit fewer bits per coefficient. An update 
step can also be computed for data from each even node to 
produce smooth coefficients, but the number of bits needed to 
encode smooth coefficients is typically the same as the num- 
ber of bits needed for raw data. Therefore, in this work even 
nodes do not compute any update coefficients. 

In some implementations, the splitting is based on depth 
(e.g., odd depthnodes are odd, even depth nodes are even). In 
this case, all even nodes forward raw data one hop to their odd 
parents, odd parents compute predictions, then odd parents 
update data from their even children. Since splitting is based 
on depth, roughly half of the nodes will be even, thus, roughly 
half of the nodes transmit raw data. This large number (50%) 
or raw data nodes is due to the splitting on the tree, which has 
roughly half even depth nodes. 

Node partitions (e.g., even or odd) can be determined 
which minimize cost of transmissions. In case of the uni- 
directional transform described above this problem boils 
down to minimizing the number of even nodes in the network. 
In addition, order to reduce the energy of prediction residues 
(which leads to lower bit-rate) we want each odd node in the 
network to have at least one even node in its neighborhood to 
compute its detail coefficients. Given the radio ranges of the 
nodes, this becomes a set covering problem on directed com- 
munication graph. The radio ranges affect the number of 
neighboring nodes, and thus size of the set-cover. As a first 
stage we fix the radio range of each node to the minimum 
value that guarantees that a node can transmit data to its 
parent in the routing tree. 

Set Covering Problem: For Graph G=(V, E) denote closed 
neighborhood n [v] =n [v] ={v}U{ueV:vueE} for all nodes veV. 
Given a collection N of all sets {n [v] }, a set-cover C^N is a 
sub collection of the sets whose union is V. The set-covering 
problem is, given N, to find a minimum-cardinality set cover. 


24 

In our case the neighborhood n M for node v is set of all 
nodes within the radio range of node v. Once we obtain a 
set-cover C={n [v . ] } 76l _ 2j p , we denote set { I v y | } /frl 2> p as 

even nodes and remaining nodes as odd nodes. The problem 
5 in this form is also referred to as dominating set problem. 
Set-covering problem for unweighted undirected graphs is 
NP-hard in general. However it can be solved by a natural 
greedy algorithm that iteratively adds a set that covers the 
highest number of yet uncovered elements. It provides a good 
to approximation and can be implemented in a distributed way. 
The algorithm is same for directed graphs with the exception 
that sets with highest outdegree of central node are added first 
to the cover. FIG. 12 shows an example of an algorithm for 
choosing a greedy set cover in an unweighted directed graph. 
15 FIG. 13 shows an example of an algorithm for choosing a 
greedy set cover in a weighted vertex directed graph. In the 
weighted set-covering problem, for each set n M eN a weight 
w,>0 is also specified, and the goal is to find a set cover C of 
minimum total weight. In the context of our problem weight 
20 w,, for node v is the total cost of transmitting raw data from 
node v to the sink along the routing path. The greedy algo- 
rithm for weighted set cover builds a cover by repeatedly 
choosing a a set n M eN that minimizes the weight w v divided 
by number of elements in n M not yet covered by chosen sets. 
25 FIG. 14 shows an example of cost comparisons of different 
lifting schemes. A transform with a Haar-like tree-based split 
(Transform A) and an extension of this transfonn where odd 
nodes perform additional levels of decomposition on data 
received from their even children (Transform B) are com- 
30 pared with the unidirectional transforms with graph-based 
splits described herein. The described transforms include 
unweighted set cover based even/odd split on graph-based 
splits (Transform C) and weighted set cover based even/odd 
split on graph-based splits (Transfonn D). The transforms use 
35 a data adaptive prediction filter design. In this graph 1405, the 
raw data cost and a total cost is shown for each transform. The 
cost of raw data transmissions for the weighted set cover 
based split (Transform D) is lower than the cost of raw data 
transmissions for unweighted set cover based split (Trans- 
40 form C). This is to be expected since even nodes selected by 
weighted set cover algorithms now have lower costs of trans- 
mitting data to the sink. 

FIG. 15 shows an example of raw data transmissions 
required by different lifting schemes. The graph 1505 shows, 
45 for networks of different sizes, the number of raw data trans- 
missions taking place for a transform with a Haar-like tree- 
based split and the number of raw data transmissions taking 
place for a unidirectional transform with graph-based splits. 
As shown by the graph, graph-based splits leads to a signifi- 
50 cant reduction in raw data transmissions. Assuming a nearly 
uniform deployment of sensors, the distances between nodes 
are roughly equal. Hence reduction in the number of raw 
transmissions is directly proportional to the reduction in 
transmissions costs as shown in FIG. 14. 

55 FIGS. 16A and 16B show examples of different transfor- 
mation structures. In these figures, circles denote even nodes 
and x’s denote odd nodes, the sink is shown in the center as a 
square, solid lines represent forwarding links, and dashed 
lines denote broadcast links. In these examples, an AR-2 
60 model is used to generate noise-free simulation data with high 
spatial data correlation, e.g., the amount of data correlation 
between two nodes increases as the distance between them 
decreases. We also assume that raw measurements use 12 bits. 
FIG. 16A shows a transformation structure based on a short- 
65 est path routing tree. A randomly generated 50 node network 
and a shortest path routing tree (SPT) is computed and used 
for routing. The SPT is depicted based on an even and odd 
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splitting. FIG. 16B shows a transformation structure based on 
graph-based splits. The graph depicts a transformation struc- 
ture based on graph-based splits. Note that although both 
FIGS. 16A and 16B have the same underlying routing struc- 
ture (solid lines), the number of required even nodes are lesser 
for a graph-based transform than a tree-based transform. This 
leads to reduction in raw-data transmission costs. 

FIG. 17 shows an example comparing energy consumption 
for different lifting schemes. The graph 1705 plots energy 
consumption versus reconstruction quality in terms of Signal 
to Quantization Noise Ratio. 

Each point in the graph 1705 corresponds to a different 
quantization level with adaptive arithmetic coding applied to 
blocks of 50 coefficients at each node. Hie graph 1705 shows 
the energy consumption for a 1 -level tree-based split, a multi- 
level tree-based split, an unweighted graph-based split, a 
weight graph-based split, and the raw data no-transformation 
base case. The two graph-based splits out performed the two 
tree-based splits, and, obviously, all four out performed the 
raw data no -transformation base cases. 

The transforms based on graph-based splits perform better 
because these transformations seeks to minimize the number 
of nodes that transmit raw data to their neighbors, therein 
reducing the total energy consumed in the data gathering 
process. The transforms based on tree-based splits have 
roughly 50% raw data nodes, hence, they are not as efficient 
as the two transforms with graph-based splits (which have 
roughly 25% raw data nodes). 

One or more of the described techniques can be applied to 
any arbitrary W SN, based on it being computed as the data are 
routed toward the sink. The schedule of computation and the 
even-odd assignment of nodes can be pre-fed into sensors at 
initialization. This transform design can be seen as precursor 
to a new class of algorithms which would focus on minimiz- 
ing raw data transmissions in a WSN by jointly optimizing 
routing tree and even/odd partition (or raw nodes/aggregating 
nodes partition). 

The disclosed and other embodiments and the functional 
operations described in this document can be implemented in 
digital electronic circuitry, or in computer software, firm- 
ware, or hardware, including the structures disclosed in this 
document and their structural equivalents, or in combinations 
of one or more of them. The disclosed and other embodiments 
can be implemented as one or more computer program prod- 
ucts, i.e., one or more modules of computer program instruc- 
tions encoded on a computer readable medium for execution 
by, or to control the operation of, data processing apparatus. 
The computer readable medium can be a machine-readable 
storage device, a machine-readable storage substrate, a 
memory device, a composition of matter effecting a machine- 
readable propagated signal, or a combination of one or more 
them. The tenn “data processing apparatus” encompasses all 
apparatus, devices, and machines for processing data, includ- 
ing by way of example a programmable processor, a com- 
puter, or multiple processors or computers. The apparatus can 
include, in addition to hardware, code that creates an execu- 
tion environment for the computer program in question, e.g., 
code that constitutes processor firmware, a protocol stack, a 
database management system, an operating system, or a com- 
bination of one or more of them. A propagated signal is an 
artificially generated signal, e.g., a machine-generated elec- 
trical, optical, or electromagnetic signal, that is generated to 
encode information for transmission to suitable receiver 
apparatus. 

A computer program (also known as a program, software, 
software application, script, or code) can be written in any 
form of progra mmi ng language, including compiled or inter- 
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preted languages, and it can be deployed in any form, includ- 
ing as a stand alone program or as a module, component, 
subroutine, or other unit suitable for use in a computing 
environment. A computer program does not necessarily cor- 
5 respond to a file in a file system. A program can be stored in 
a portion of a file that holds other programs or data (e.g., one 
or more scripts stored in a markup language document), in a 
single file dedicated to the program in question, or in multiple 
coordinated files (e.g., files that store one or more modules, 
to sub programs, or portions of code). A computer program can 
be deployed to be executed on one computer or on multiple 
computers that are located at one site or distributed across 
multiple sites and interconnected by a communication net- 
work. 

15 The processes and logic flows described in this document 
can be performed by one or more programmable processors 
executing one or more computer programs to perform func- 
tions by operating on input data and generating output. The 
processes and logic flows can also be performed by, and 
20 apparatus can also be implemented as, special purpose logic 
circuitry, e.g., an FPGA (field programmable gate array) or an 
ASIC (application specific integrated circuit). 

Processors suitable for the execution of a computer pro- 
gram include, by way of example, both general and special 
25 purpose microprocessors, and any one or more processors of 
any kind of digital computer. Generally, a processor will 
receive instructions and data from a read only memory or a 
random access memory or both. The essential elements of a 
computer are a processor for performing instructions and one 
30 or more memory devices for storing instructions and data. 
Generally, a computer will also include, or be operatively 
coupled to receive data from or transfer data to, or both, one 
or more mass storage devices for storing data, e.g., magnetic, 
magneto optical disks, or optical disks. However, a computer 
35 need not have such devices. Computer readable media suit- 
able for storing computer program instructions and data 
include all forms of non volatile memory, media and memory 
devices, including by way of example semiconductor 
memory devices, e.g., EPROM, EEPROM, and flash memory 
40 devices; magnetic disks, e.g., internal hard disks or remov- 
able disks; magneto optical disks; and CD ROM and DVD- 
ROM disks. The processor and the memory can be supple- 
mented by, or incorporated in, special purpose logic circuitry. 

While this document contains many specifics, these should 
45 not be construed as limitations on the scope of an invention 
that is claimed or of what may be claimed, but rather as 
descriptions of features specific to particular embodiments. 
Certain features that are described in this document in the 
context of separate embodiments can also be implemented in 
50 combination in a single embodiment. Conversely, various 
features that are described in the context of a single embodi- 
ment can also be implemented in multiple embodiments sepa- 
rately or in any suitable sub-combination. Moreover, 
although features may be described above as acting in certain 
55 combinations and even initially claimed as such, one or more 
features from a claimed combination can in some cases be 
excised from the combination, and the claimed combination 
may be directed to a sub-combination or a variation of a 
sub-combination. Similarly, while operations are depicted in 
60 the drawings in a particular order, this should not be under- 
stood as requiring that such operations be performed in the 
particular order shown or in sequential order, or that all illus- 
trated operations be performed, to achieve desirable results. 

Only a few examples and implementations are disclosed. 
65 Variations, modifications, and enhancements to the described 
examples and implementations and other implementations 
can be made based on what is disclosed. 
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What is claimed is: 

1. A method implemented by a local node in a wireless 
sensor network, the method comprising: 

transmitting, from the local node, a first pilot signal using 
a local power level that controls a radio range of signals 5 
transmitted from the local node; 
detecting, at the local node, one or more remote nodes 
included in the wireless sensor network, wherein detect- 
ing the one or more remote nodes comprises receiving 
one or more second pilot signals from the one or more 10 
remote nodes; 

transmitting a local outdegree to the one or more detected 
remote nodes, the local outdegree being based on a 
quantity of the one or more detected remote nodes; 15 
receiving one or more remote outdegrees from the one or 
more detected remote nodes; 
determining a local node type of the local node, wherein 
determining the local node type comprises (i) exchang- 
ing messages with at least a portion of the one or more 20 
detected remote nodes, the messages indicating a node 
type, (ii) selectively assigning the local node type as a 
raw type based on information comprising the local 
outdegree, the one or more remote outdegrees, and one 
or more of the messages, and (iii) selectively assigning 25 
the local node type as an aggregation type based on a 
neighbor node being of the raw type, wherein a node of 
the raw type sends raw data to a node of the aggregation 
type, wherein the node of the aggregation type aggre- 
gates and compresses raw data; and 
adjusting characteristics, including an energy usage char- 
acteristic and a data compression characteristic, of the 
wireless sensor network by (i) selectively modifying the 
local power level and (ii) selectively changing the local , ,- 
node type. 

2. The method of claim 1, wherein the first pilot signal 
comprises an indication of the local power level, wherein the 
one or more received second pilot signals each comprise an 
indication of a power level, and wherein detecting the one or 40 
more remote nodes comprises: 

transmitting an acknowledgement message based on a 
maximum power level indicated by the one or more 
received second pilot signals; and 
receiving one or more acknowledgements from at least a 45 
portion of the one or more remote nodes. 

3. The method of claim 2, wherein the local node type is the 
raw type, and wherein adjusting the characteristics com- 
prises: 

increasing the local power level to increase the radio range 50 
to reach one or more additional remote nodes; and 
transmitting an additional signal at the increased local 
power level to cause at least a portion of the one or more 
additional remote nodes to become the aggregation type. 

4. The method of claim 1, wherein adjusting the character- 55 
istics comprises performing an approximation-based distrib- 
uted minimum set cover modification algorithm to change 
one or more node type assignments within the wireless sensor 
network to increase a data compression ratio. 

5. The method of claim 4, wherein performing the approxi- 60 
mation-based distributed minimum set cover modification 
algorithm comprises increasing the local power level to 
increase the radio range to reach one or more additional 
remote nodes. 

6. The method of claim 1, wherein the node of the aggre- 65 
gation type is configured to perform a portion of a distributed 
wavelet transform to generate compressed data. 
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7. An apparatus comprising: 

transceiver electronics to co mm unicate with a wireless 
sensor network; and 

processor electronics configured to (i) control the trans- 
ceiver electronics to transmit a first pilot signal using a 
local power level that controls a radio range of signals 
transmitted via the transceiver electronics, (ii) detect one 
or more nodes of the wireless sensor network based on a 
reception, via the transceiver electronics, of one or more 
second pilot signals from the one or more nodes of the 
wireless sensor network, (iii) control the transceiver 
electronics to transmit a local outdegree to the one or 
more detected nodes, the local outdegree being based on 
a quantity of detected nodes, (iv) receive, via the trans- 
ceiver electronics, one or more remote outdegrees from 
the one or more detected nodes, (v) determine a local 
node type of the apparatus based on the local outdegree, 
the one or more remote outdegrees, and one or more 
detected node types corresponding to at least a portion of 
the one or more detected nodes, respectively, and (vi) 
adjust characteristics, including an energy usage char- 
acteristic and a data compression characteristic, of the 
network by selectively modifying the local power level 
and selectively changing the local node type. 

8. The apparatus of claim 7, wherein the processor elec- 
tronics are configured to (i) exchange messages with at least 
a portion of the detected nodes, the messages indicating a 
node type, (ii) selectively assign the local node type as a raw 
type based on information comprising the local outdegree, the 
one or more remote outdegrees, and one or more of the mes- 
sages, and (iii) selectively assign the local node type as an 
aggregation type based on a neighbor node being of the raw 
type, wherein a node of the raw type sends raw data to a node 
of the aggregation type, wherein the node of the aggregation 
type aggregates and compresses raw data. 

9. The apparatus of claim 7, wherein the first pilot signal 
comprises an indication of the local power level, wherein the 
one or more second pilot signals each comprise an indication 
of a power level, and wherein the processor electronics are 
configured to transmit an acknowledgement message based 
on a maximum power level indicated by the one or more 
received second pilot signals, and receive one or more 
acknowledgements from the one or more detected nodes. 

10. The apparatus of claim 7, wherein the processor elec- 
tronics are configured to (i) cause an increase of the local 
power level to increase the radio range to reach one or more 
additional nodes and (ii) control a transmission of an addi- 
tional signal at the increased local power level to cause the 
one or more additional nodes to become an aggregation type, 
wherein the local node type is a raw type, wherein a node of 
the aggregation type aggregates and compresses raw data. 

11. The apparatus of claim 10, wherein the node of the 
aggregation type is configured to perform a portion of a 
distributed wavelet transform to generate compressed data. 

12. The apparatus of claim 7, wherein the processor elec- 
tronics are configured to perform an approximation-based 
distributed minimum set cover modification algorithm to 
change one or more node type assignments within the net- 
work to increase a data compression ratio. 

13. The apparatus of claim 12, wherein the approximation- 
based distributed minimum set cover modification algorithm 
includes increasing the local power level to increase the radio 
range to reach one or more additional nodes. 

14. A system comprising: 

one or more first nodes, comprising first processor elec- 
tronics and first transceiver electronics, configured to 
collect data, route data, or collect and route data within 
a wireless sensor network; and 
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a second node, comprising second processor electronics 
and second transceiver electronics, configured to collect 
data, route data, or collect and route data within the 
wireless sensor network, wherein the second processor 
electronics are configured to perform operations com- 5 
prising: 

controlling the second transceiver electronics to trans- 
mit a first pilot signal using a local power level that 
controls a radio range of signals transmitted from the 
second node; 1° 

detecting one or more of the first nodes based on a 
reception, via the second transceiver electronics, of 
one or more second pilot signals from at least a por- 
tion of the first nodes; 

controlling the second transceiver electronics to trans- 15 
mit a local outdegree, the local outdegree being based 
on a quantity of the one or more detected first nodes; 
controlling the second transceiver electronics to receive 
one or more remote outdegrees from the one or more 
detected first nodes; 20 

controlling the second transceiver electronics to 
exchange messages with at least a portion of the one 
or more detected first nodes, wherein the messages 
comprise an indication of a node type of a transmit- 
ting node, the node type being selected from a group 25 
comprising an aggregation type and a raw type, 
wherein a node of the raw type is configured to send 
raw data to a node of the aggregation type, wherein a 
node of the aggregation type is configured to aggre- 
gate and compress raw data; 30 

determining the node type of the second node based on 
information comprising the local outdegree, the one 
or more remote outdegrees, and one or more of the 
messages; and 

adjusting characteristics, including an energy usage 35 
characteristic and a data compression characteristic, 
of the wireless sensor network by transmitting one or 
more additional messages based on (i) selectively 
modifying the local power level and (ii) selectively 
changing the node type of the second node. 40 

15. The system of claim 14, wherein determining the node 
type of the second node comprises (i) detecting, via one or 
more of the messages, whether at least one of the detected first 
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nodes is of the raw type, (ii) selectively assigning the node 
type of the second node as the raw type based on information 
comprising the local outdegree, the one or more remote out- 
degrees, and a lack of a detection that at least one of the 
detected first nodes is of the raw type, and (iii) selectively 
assigning the node type of the second node as an aggregation 
type based on a detection that at least one of the detected first 
nodes is of the raw type. 

16. The system of claim 15, wherein the first pilot signal 
comprises an indication of the local power level, wherein the 
one or more received second pilot signals each comprise an 
indication of a power level, and wherein detecting the one or 
more of the first nodes comprises: 

controlling the second transceiver electronics to transmit 
an acknowledgement message based on a maximum 
power level indicated by the one or more received sec- 
ond pilot signals; and 

controlling the second transceiver electronics to receive 
one or more acknowledgements from at least a portion of 
the first nodes. 

17. The system of claim 15, wherein the node type of the 
second node is the raw type, and wherein adjusting the char- 
acteristics comprises: 

increasing the local power level to increase the radio range 
to reach one or more additional nodes; and 

controlling the second transceiver electronics to transmit 
an additional signal at the increased local power level to 
cause at least a portion of the one or more additional 
nodes to become the aggregation type. 

18. The system of claim 15, wherein the node of the aggre- 
gation type is configured to perform a portion of a distributed 
wavelet transform to generate compressed data. 

19. The system of claim 14, wherein adjusting the charac- 
teristics comprises performing an approximation-based dis- 
tributed minimum set cover modification algorithm to change 
one or more node type assignments of nodes within the sys- 
tem to increase a data compression ratio. 

20. The system of claim 19, wherein performing the 
approximation-based distributed minimum set cover modifi- 
cation algorithm comprises increasing the local power level 
to increase the radio range to reach one or more additional 
nodes. 



